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GENE EXPRESSION IN MONOCYTES AND MACROPHAGES 

5 The present invention relates to the regulatory nucleotide sequences 
associated with the CD68 gene. 

Background to the Invention 
io L ppu$ Control Re gions 

Mammalian gene expression is regulated through cis- linked 
DNA sequences. Promoter sequences lie immediately 5' of the gene's 
transcription initiation site and enhancer sequences can lie within or close 
to the gene which they regulate. In 1987 it was shown that DNA 

15 sequences found within 70 kilobases (kb) 5' of the human, p- globin gene 
on human chromosome 1 1 were able to effect high-level, copy number 
dependent expression of the human p-globin gene in erythroid cells of 
transgenic mice (1) Grosveld, F., etai Cell, 1987, 51, 975-985. These 
DNA sequences which play a key role in the in vivo regulation of globin 

20 gene expression were termed Dominant Control Sequences (OCR) - later 
renamed Locus Control Regions (LCRs). Locus Control Regions have 
been described for other red cell genes such as the genes of the human a- 
gfobin locus (2) Greaves, D.R., et af Cell, 1989, 56, 979-986 and LCRs 
have been described which direct high level gene expression in other cell 

25 types. Examples include the human CD2 gene which has a T-cell specific 
LCR and the 3 1 end of the Ca immunoglobulin heavy chain locus which 
directs high level expression in B-cells (3) Madisen, L and Groudine, M. 
Genes and Development, 1994, 8, 2212-2226. 

30 The CD68 gene is expressed in all macrophage cells. The specificity of 
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expression in viva is high. The origin of this specificity has been 
investigated and surprisingly it has been found that small regions of the 
CD68 gene are responsible. ~ 

5 Summary o f inven tio n 

The present invention therefore provides a cloned or isolated 
polynucleotide having the function of a transcriptional regulatory sequence 
(trs) and comprising: 

10 (a) a polynucleotide fragment having at least 70% identity 

to the polynucleotide of Seq ID No. 2; 

(b) a polynucleotide which is complementary to the 
polynucleotide of (a); or 

(c) a polynucleotide comprising at least 1 5 sequential 
is bases of the polynucleotide of (a) or (b). 

Preferably the polynucleotide fragment has at least 80% 
identity to the polynucleotide of Seq ID No. 2, more preferably at least 90% 
identity to the polynucleotide of Seq ID No. 2. Most preferably the isolated 
20 polynucleotide according to the invention comprises the polynucleotide of 
Seq ID No.2. 

The present invention further provides an isolated 
polynucleotide comprising the transcriptional regulatory sequence of CD68. 

The present invention additionally provides an isolated 
25 polynucleotide comprising the transcriptional regulatory sequence of CD68 
and a polynucleotide operatively linked thereto encoding a heterologous 
polypeptide. 

The present invention also provides a vector comprising a 
polynucleotide as defined herein and a host cell comprising the vector. 
30 The present invention further provides a process for 
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producing a polypeptide which process comprises transforming or 
transfecting a cell with a vector as defined herein such that the eel! 

expresses the polypeptide encoded. ^ ' 

In a further aspect this invention results from the discovery of DNA 
sequences exhibiting the properties of a locus control region associated 
with the CD68 gene. In one aspect the invention provides a vector for the 
integration of a gene into the genetic material of a mammalian host cell 
such that the gene may be expressed in the host cell, the vector 
comprising a promoter and the said gene and a Locus Control Region 
capable of eliciting host cell-type restricted, integration site independent, 
copy number dependent expression of said gene, characterised in that the 
Locus Control Region is located within a region extending from 14kb 
upstream to 25 kb downstream of the CD68 gene. 
Functional definition of an LCR 

Locus Control Regions are fundamentally different from other 
gene regulatory sequences in that they are not subject to position effects 
after integration into host cell chromosomes. Work with the human p- 
globin gene LCR revealed the three criteria which distinguish this class of 
DNA sequence from other regulatory sequences such as promoters and 
enhancers (1,4,5). Grosveld, R, etal Cell, 1987, 51, 975-985., Blom van 
Assendelft, M., etal, Cell, 1989, 56, 969-977., Talbot, D. t etai Nature, 
1989, 338, 352-355. 

1 ) LCRs direct position-independent expression of co-linked 
genes after integration into host genomes either in transgenic animals or in 
cell lines. As a consequence all transgenic animals carrying an intact copy 
of the transgene will express the transgene at a significant level. This is in 
sharp contrast to results obtained with other DNA sequences. 

2 ) LCRs exhibit strict tissue specificity in the pattern of 
transgene expression. A human p-globin gene placed downstream of a 
human 0- globin LCR is expressed only in red cells of transgenic mice 
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(1, 4), The exact same human p-globin gene fragment placed downstream 
of a human CD2 LCR is expressed at high levels in all T-cells but not red 
cells of transgenic animals (2). Greaves, D.R., et al. Cell, 1989, 56,~§79- 
986. 

5 3) LCRs direct high-level, copy number-dependent gene 

expression. Human p-globin transgenes can be expressed as efficiently on 
a per copy basis as endogenous murine p-globin genes in the erythroid 
cells of transgenic mice. When each transgene expresses at the same 
high level this leads to a direct relationship between transgene expression " 

io and transgene copy number. 

LCR's are thus expected to have a region providing chromatin opening 
activity, which region provides at least the first and third activities described 
above. An LCR generally contains at least one transcriptional regulatory 
sequence such as a promoter or enhancer in addition to the sequence 

is which provides chromatin opening activity. 



The genetic diseases p- thalassaemia and sickle cell 
anaemia are caused by mutations within the human p- globin gene locus. 
All the red blood cells of the body are derived from a limited number of 
20 haematopoietic stem cells found in the bone marrow. The demonstration 
that the p-globin LCR was able to direct red cell-specific expression 
regardless of its site of integration into the genome lead us to propose the 
idea of somatic gene therapy for p- thalassaemia and sickle cell anaemia 
by introducing, p-globin LCR vectors into haematopoietic stem cells and 
25 using such transfected celts in bone marrow transplantation. 

This is described in one of the proposed applications set out 
in the original p-globin LCR patents filed in 1987 and 1989 (UK patents 
8718779 & 8904009.1). 

The Locus Control Region (LCR) of the invention may be 
30 located within a region extending from 5.5 kb upstream to 12 kb 
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downstream of the CD68 gene; particularly from 2940 base pairs (bp) 
upstream of CD68 gene to 335 bp downstream (shown in Seq ID No. 3) 
and more particularly within a 3 kb BstX1-BstXHoctrs immediately 
upstream of the CD68 gene. The LCR may be a single continuous 

5 sequence or may consist of two or more such sequences linked together 
with or without intervening polynucleotides. The LCR may consist of, be 
derived from, or correspond to one or more DNAse hypersensitive sites. If 
the LCR of the naturally occurring gene locus comprises two or more 
. discrete sub-sequences separated by intervening non-functional 

io sequences (for example, two or more hypersensitive sites) the vector of the 
invention may comprise an LCR comprising two or more of the sub- 
sequences linked together with all or part of the intervening sub-sequences 
removed. 

The term "vector" as used herein connotes in its broadest 
15 sense any recombinant DNA material capable of transferring DNA from 
one cell to another. 

In another aspect the invention provides a mammalian host 
cell transformed with a vector as defined. The mammalian host ceJI is 
selected from macrophages, monocytes and dendritic cells and their 
20 precursors. 

In other aspects the invention provides: a method of 
producing a polypeptide by culturing a mammalian host cell as defined; a 
method of modifying mammalian host or stem cells by transformation with 
a vector as defined; transformed mammalian host or stem cells for use in 
25 the treatment of a diseased condition in a human or animal body; and use 
of a vector as defined, or mammalian host or stem cells as defined, for the 
manufacture of a medicament for the treatment of a disease condition of 
the human or animal body caused by a gene deficiency. 

Figure? 

30 Figure 1- restriction maps of cosmids containing the human 
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CD68 gene. 

Figure 2 - DNA sequence of the 5' flanking regions of the 
human CD68 gene, (shown also in Seq ID No. T) 

Figure 3 - Northern blot analysis of RNA expression in stably 
5 transfected RAW cells. 

Figure 4 - RT PCR analysis of CD68 and macrosialin RNAs 
in stably transfected RAW cells. 

Figure 5 - Northern blot analysis of RNA expression in stably 
transfected RAW and A20 cells 
io Figure 6 - RT PCR analysis of CD68 and HPRT RNAs in 

stably transfected RAW and A20 cells. 

Detailed Description 

According to the present invention CD68 genes include 
is human CD68, mammalian non-human CD68 t for example mouse, dog t cat, 
rabbit, pig, cow, horse or rat CD68, or non-mammalian CD68. Mouse 
CD68 is known as macrosialin. Preferably the CD68 is human. 

The nucleotide sequences which regulate transcription of 
CD68 gene are contained within the COSMID CD68C1 which is obtainable 
20 from a commercially available human genomic DNA cosmid library 
(Stratagene). 

A transcriptional regulatory sequence (trs) generally 
comprises at least one promoter region and optionally at least one 
enhancer region. A trs may be a single continuous nucleotide sequence or 

25 may consist of two or more such sequences linked together with or without 
intervening polynucleotides. Trs regions are generally 5' (upstream) from 
the coding region but may also be found 3' (downstream) of the ATG start 
codon, for example in regions of coding sequence or in an intron. 
Specifically regions of human CD68 trs have been identified in the first 

30 intron downstream of the ATG start codon. 
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It the trs of the naturally occurring gene locus comprises two 
or more discrete sub-sequences separated- by^rrterveningnnron-functional 
sequences (for example, two or more super hypersensitive sites) the vector 
5 of the invention may comprise a trs comprising two or more of the sub- 
sequences linked together with all or part of the intervening sub-sequences 
removed. 

The vector of the invention may be used to integrate into the genome an 
expression cassette which comprises a CD68 trs and a polynucleotide 
10 operatively linked thereto encoding a heterologous polypeptide. The 
vector may also be used to integrate a chimearic gene. 

In either case the open reading frame or coding sequence of 
the gene or cassette is expressed in the host cell. 

The vector comprises at least a trs and an open reading 
15 frame. An open reading frame may comprise introns. An open reading 
frame may comprise only coding regions. 

A chimearic gene comprises at least a trs and an open 
reading frame. Optionally a chimearic gene will further comprise 
polynucleotide regions which regulate replication, transcription or 
20 translation or any other process important to the expression of the 
polynucleotide in a host cell 

The position of sequences relative to the CD68 gene is 
generally measured relative to the ATG start codon for upstream regions 
and relative to the stop codon for downstream regions. 
25 Isolated or cloned means separate "by the hand of man" from 

its natural state; /,e., that, if it occurs in nature, it has been changed or 
removed from its original environment, or both. For example, a naturally 
occurring polynucleotide or a polypeptide naturally present in a living 

■ 

organism in its natural state is not "isolated," but the same polynucleotide 
30 or polypeptide separate from the coexisting materials of its natural state is 
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"isolated", as the term is employed herein. As part of or following isolation, 
such polynucleotides can be joined to other polynucleotides, such as 
DNAs, for mutagenesis, to form fusiorTproteins, and for propagation or 
expression in a host, for instance. The isolated polynucleotides, alone or 
joined to other polynucleotides such as vectors, can be introduced into 
host cells, in culture or in whole organisms. Introduced into host cells in 
culture or in whole organisms, such DNAs still would be isolated, as the 
term is used herein, because they would not be in their naturally occurring 
form or environment Similarly, the polynucleotides and polypeptides may ~~ 
occur in a composition, such as media formulations, solutions for 
introduction of polynucleotides or polypeptides, for example, into cells, 
compositions or solutions for chemical or enzymatic reactions, for instance, 
which are not naturally occurring compositions, and, therein remain 
isolated polynucleotides or polypeptides within the meaning of that term as 
it is employed herein. 

Expression cassettes themselves are well known in the art of molecular 
biology. Such an expression cassette contains all essential DNA 
sequences required for expression of the heterologous enzyme in a 
mammalian cell. For example, a preferred expression cassette will contain 
a molecular chimaera containing a coding sequence an appropriate 
polyadenylation signal for a mammalian gene (i.e., a polyadenylation signal 
that will function in a mammalian cell), and enhancers and promoter 
sequences in the correct orientation. 

Nomnally, two DNA sequences are required for the complete and 
efficient transcriptional regulation of genes that encode messenger RNAs 
in mammalian cells: promoters and enhancers. Promoters are generally 
located immediately upstream (5 f ) from the start site of transcription. 
Promoter sequences are required for accurate and efficient initiation of 
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transcription. Different gene-specific promoters reveal a common pattern 
of organisation. A typical promoter includes an AT-rich region called a 
TATA box (which is located approximately^^ iJSse^ pairs~~5' to the 
transcription initiation start site) and one or more upstream promoter 
elements (UPEs). The UPEs are a principle target for the interaction with 
sequence-specific nuclear transcriptional factors. The activity of promoter 
sequences is modulated by other sequences called enhancers. The 
enhancer sequence may be a great distance from the promoter in either an 
upstfeam^{5') or downstream (3 f ) position. Hence, enhancers operate in an 
orientation- and position-independent manner. However, based on similar 
structural organisation and function that may be interchanged, the absolute 
distinction between promoters and enhancers is somewhat arbitrary. 
Enhancers increase the rate of transcription from the promoter sequence. 
It is predominantly the interaction between sequence-specific 
transcriptional factors with the UPE and enhancer sequences that enable 
mammalian cells to achieve tissue-specific gene expression. The 
presence of these transcriptional protein factors (tissue-specific, ■ 
trans-activating factors) bound to the UPE and enhancers (cis-acting, 
regulatory sequences) enables other components of the transcriptional 
machinery, including RNA polymerase, to initiate transcription with 
tissue-specific selectivity and accuracy. 

Identity, as known in the art, is the relationship between two or more 
polynucleotide sequences, as determined by comparing the sequences. In 
the art, identity also means the degree of sequence relatedness between 
polynucleotide sequences, as the case may be, as determined by the 
match between strings of such sequences. Identity can be readily 
calculated (Computational Molecular Biology, Lesk, A.M., ed M Oxford 
University Press, New York, 1988; Biocomputing: Informatics and Genome 
Projects, Smith, D.W.. ed„ Academic Press, New York, 1993; Computer 
Analysis of Sequence Data, Part I, Griffin, A.M., and Griffin, H.G., eds., 
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Humana Press, New Jersey, 1994; Sequence Analysis in Molecular 
Biology, von Heinje, G., Academic Press, 1987; and Sequence Analysis 
Primer, Gribskov, M. and DevereuxTJ*. edS^M Stockton Press, New York, * 
1 991). While there exist a number of methods to measure identity 
5 between two polynucleotide sequences, the term is well known to skilled 
artisans (Sequence Analysis in Molecular Biology, von Heinje, G., 
Academic Press, 1987; Sequence Analysis Primer, Gribskov, M. and 
Devereux, J M eds., M Stockton Press, New York, 1991; and Carillo, H., and 
—-LipmanrD./SIAM J. Applied Math:, 48:1073(1988). Methods commonly 
io employed to determine identity between sequences include, but are not 
limited to those disclosed in Carillo, H., and Lipman, D„ SIAM J. Applied 
Math., 48:1073 (1988). Preferred methods to determine identity are 
designed to give the largest match between the sequences tested. 
Methods to determine identity are codified in computer programs. 
15 Preferred computer program methods to determine identity between two 
sequences include, but are not limited to, GCG program package 
(Devereux, J., et al. a Nucleic Acids Research 12(1): 387 (1984)), BLASTP, 
BLASTN, and FASTA (Atschul, S.F. et aL, J. Molec. Biol. 215: 403 (1990)). 

Polynucleotides which have 70% or more identity with the 
20 polynucleotides herein defined may differ from the defined sequences by 
virtue of at least one nucleotide substitution, addition, deletion, fusion or 
truncation in the polynucleotide. 

One of the regions in the CD68 gene which has been found 
to be critical for the macrophage specificity of the CD68 gene is the region 
25 which extends from the ATG start codon to the nucleotide approximately 
80 base pairs upstream (5') to the ATG start codon. This will be referred to 
as the -80bp region. 

A second region which has been found to be critical is the 
region downstream (3') of the ATG start codon of the CD68 gene and 
30 which contains the first PU.1 site downstream of the start codon. A PU.1 
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site is a DNA sequence capable of binding the etf - family transcription 
factor PLU which is expressed in macrophages, neutrophils and B-cells. 

The first PU.1 site downstream of the st^PTcodon is ~ 
comprised within an intron. In human CD68 the intron containing the first 
5 PU.1 site is from nucleotide 32 to nucleotide 137 downstream of the start 
codon. 

The region containing the first downstream PU.1 site and the 
-80bp region when operatively linked provide macrophage specificity. Seq. 
ID No^roruprises the sequence of human CD68 from -80bp to the"ATG 
10 start codon and comprises a PU.1 site and the first intron downstream of 
the ATG start codon. 

A further region which has been found to be critical in 
providing specificity is the region containing the PU.1 site approximately 
1 10bp upstream of the ATG start codon. Thus the nucleotide sequence 
is from the ATG start codon to approximately 150 base pairs upstream from 
the ATG start codon confers enhanced macrophage specific expression. 

A further region which confers specificity is at the PU. 1 site 
which is found approximately 432 base pairs upstream from the ATG start 
codon. Thus the nucleotide squence which extends from the ATG start 
20 codon to approximately 460 base pairs upstream from the start codon 
confers further enhanced macrophage specific expression. 

High macrophage specificity is observed when a gene is 
expressed under the control of any of the above sequences. High 
macrophage specificity may also be obtained when a gene is expressed 
25 under the control of the region which extends from 2940 base pairs 
upstream of the ATG start codon to 335 base pairs downstream of the 
TGA stop codon. 

The first sequence of the first intron of macrosialin has also 
been determined and is also a polynucleotide of the invention. The whole 
30 sequence of the open reading frame of macrosialin is given in Seq. ID No 
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4. Oligonucleotide primers derived from the murine macrosialin cDNA 
sequence were used to PCR amplify a 1759bp fragment from Sv129 
mouse genomic DNA. The genomic PCR-product war cloned into the 
InVitrogen plasmid vector pCRII and the macrosialin gene sequence was 
determined by dye terminator DNA sequencing of double stranded DNA. 
DNA coding sequence is shown in uppercase and underlined, 5' and 3' 
untranslated regions and intervening sequences are shown in lowercase. 
The ATG initiator and TGA termination codons are underlined. 

The present invention therefore provides a-polynucieotide 

comprising 

(a) a polynucleotide fragment having at least 70% identity 
to a polynucleotide as defined herein; 

(b) a polynucleotide which is complementary to the 
polynucleotide of (a); or 

(c) a polynucleotide comprising at least 15 sequential 
bases of the polynucleotide of (a) or (b) 

Preferably the polynucleotide fragment has at least 80% 
identity to a polynucleotide as herein defined, more preferably at least 90% 
identity to the polynucleotide as herein defined. Most preferably the 
isolated polynucleotide according to the invention comprises the 
polynucleotide as herein defined. 

Specifically the present invention provides a 
polynucleotide comprising an isolated polynucleotide comprising: 

(a) a polynucleotide fragment having at least 70% identity 
to the polynucleotide of Seq ID No. 3; 

(b) a polynucleotide which is complementary to the 
polynucleotide of (a); or 

(c) a polynucleotide comprising at least 15 sequential 
bases of the polynucleotide of (a) or (b). 

A preferred nucleotide according to the invention comprises 
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the region containing the first PU1 site and the -80bp region. 

A further preferred nucleotide comprises the region from 
+150 bp to -137 bp relative to the ATG start codon. ~ " " 

A still further preferred nucleotide comprises the region from 
5 +460 bp to -137 bp relative to the ATG start codon. 

The nucleotide sequence of each of these regions may be 
determined from Seq. ID No 3 which shows the sequence of cosmid 
CD68C1 from 2951bp 5* of the CD68 ATG initiation codon (shown in bold 
type) to 2090 bp -3' of the CD68 TGA termination codon (shown in bold 
10 type). Intron sequences are in lowercase, and CD68 coding and 3' 
untranslated regions are underlined. 

Each of the nucleotide sequences of the invention may be 
used in inverted form. 

More than one copy, for example two, three or four copies, of 
is each of the regions described above may be used to control expression of 
a single gene. When more than one copy of a region is used, the copies 
are preferably operably linked. The regions may be used in combination or 
separately. 

The sequence ID No. 2 may be used as a probe to locate the 
20 sequence ID No. 3, which may then be manipulated according to 
conventional techniques using known restriction sites. 

In a search for macrophage-specific gene regulatory 
sequences recombinant cosmids were isolated containing the human 
CD68 gene by PCR screening of a human genomic DNA library in the 
25 cosmid vector pWE15. Human CD68 is the human homologue of mouse 
macrosialin, a protein which is found in the phagosomal compartment of all 
monocytes and macrophages. Two types of human CD68 cosmids were 
obtained which contain the complete CD68 gene with 14 kilobases of 5 1 
flanking sequences and 25 kilobases of 3' flanking sequences (Figure 1 ). 
30 The cosmids were used as probes to demonstrate that the 
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human CD68 gene is located on the short arm of human chromosome 1 7. 
The cosmids were used as templates to determine the DNA sequence of 
the CD68 gene and 1 870bp of the CD68 promoter (Figured). 

The CD68 cosmids were transfected into the murine 
macrophage cell line RAW264.7 and stably transfected RAW cell 
populations and polyclones were selected by growth in G418. Northern 
blot analysis of total RNA isolated from G418 resistant RAW cells was 
performed using a radiolabeled human CD68 gene probe (Figure 3A) and 
a mouse lysozyme probe to control for RNA loading and transfer 
(Figure 3B). Total RNA prepared from human peripheral blood 
mononuclear cell (PBMC) cultures was analysed on the same filter. The 
human CD68 cosmid transfected RAW cells express very high levels of 
human CD68 mRNA of the expected size. The levels of human CD68 
mRNA are higher in cosmid transfected RAW cells than in cultured human 
monocytes. This result was confirmed by analysis of transfected RAW cell 
RNA by Reverse Transcription - Polymerase Chain Reaction (RT-PCR) 
analysis using human CD68 and murine macrosialin specific PCR primers 
(Figure 4). The transfected human CD68 gene is expressed at higher 
levels than the endogenous mouse macrosialin gene. 

The same CD68 cosmid DNAs were transfected into murine 
A20 B- cells an d 3T3 fibroblasts. Northern blot analysis of CD68 RNA 
expression showed levels of CD68 mRNA at least 50 times lower than that 
observed in murine macrophage cell line RAW264.7 transfected with the 
same cosmids (Figure 5). These results were confirmed by RT-PCR 
analysis of transfected cell RNA using CD68 and HPRT specific PCR 
primers (Figure 6). 

Single cell clones from G418 resistant RAW cell populations 
have been isolated and will be analysed to determine if there is a direct 
relationship between transgene copy number and transgene expression in 
transfected RAW264.7 cells which would be predicted if there is a 
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macrophage-specific Locus Control Region in the CD68 cosmids. 

The extremely high level of human CD68 mRNA seen in 
cosmid transfected RAW cells (Figures 3,4 and 5) show that important 
macrophage-specific genetic regulatory elements are contained within the 
5 CD68 cosmids cosCD68C1 and cosCD68G1. Our previous work using the 
human lysozyme promoter in transgenic animal experiments has shown an 
important role for intervening sequences and poly A+ addition sequences 
for transgene expression in macrophages (7,8) Dighe, A.S M etai 
Immunity, 1995, 3 t 657-666. and Clarke, S., etal Proc.Natl. Acad. Sci. 
jo (USA), 1996, 93; 1434-1438.. A role for intervening sequences in 

ensuring efficient transgene expression has been described in a number of 
other systems. 

Development of a macrophage-specific gene expression vector 

is In order to express heterologous genes at a high level in 

monocytes and macrophages it would be desirable to design a CD68 
expression vector. Such a vector should include all the DNA sequences in 
the human CD68 cosmids which direct high-level, macrophage-specific 
gene expression along with the CD68 introns and poly A+ sequences and 

20 unique restriction enzyme recognition sites for the insertion of cDNAs 
encoding heterologous genes of interest. 

To facilitate manipulation of the human CD68 gene 
sequences present in cosmid cosCD68C1 a 20kb EcoRI fragment which 
contains the complete human CD68 gene along with 5.5kb of 5' flanking 

25 and 12,5kb of 3' flanking sequences was cloned into the unique EcoRI site 
of the plasmid vector Bluescript SK- (Stratagene) to give the recombinant 
piasmid pCD68R1A (Figurel). Two further recombinant plasmids were 
derived from pCD68R1A which contain the 5' flanking and 3' flanking 
sequences of the human CD68 gene, pCD68RS1 and pCD68SR1 (Figure 

30 1). In order to engineer the insertion of cDNAs encoding heterologous 
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genes immediately downstream of the CD68 gene promoter and 
immediately upstream of the CD68 gene's ATG initiation codon the plasmid 
pCD68RS1 was digested with the restriction enzyme BstX1 and a 3kb 
BstX1 fragment was cloned into the plasmid vector Bluescript 
SK(Stratagene) after treatment with T4 DNA polymerase which removes 
the CD68 ATG initiation codon (Figure 2). The human CD68 genomic DNA 
fragments present in the recombinant plasmids shown in Figure 1 allow for 
the development of a versatile and easily manipulated human CD68 
expression system for use in macrophage cell lines, transgenic animals 
and human primary cells. 

Table 1 Promoter activity of CD68 prom oter 5 1 deletion seri^ j n 
transiently transfected RAW264.7 and P388.D1 cells 

RAW264.7 or P388.D1 cells were electroporated in the presence of 
20jag of the indicated CAT reporter plasmids and 5jxg of the p- 
galactosidase reporter plasmid pcDNA3 (3-gal. Forty eight hours post 
transfection cell lysates were assayed for p-galactosidase and CAT 
enzyme activity as described in Experimental Procedures. Cell lysate 
CAT enzyme activities were corrected for transfection efficiency and 
the data is expressed as a percentage of the CAT enzyme activity 
obtained with the SV40 promoter/enhancer plasmid pCATControl in the 
same experiment. All cell lysate CAT enzyme activities were within the 
linear range of the assay as determined from a CAT enzyme dilution 
series analysed in the same experiment. The data shown are from a 
single transfection experiment and the relative promoter activities were 
reproducible in at least three independent transfection experiments with 
each construct in both cell lines. 

Table 2 Comparison o f myeloid gene promoter activities in 
transiently transfected murine macrophage cell lines 
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P388.D1 or RAW264.7 cells were electroporated in the presence of 
- 20p,g of the indicated CAT reporter plasmids and 5ng of the p- 
galactosidase reporter plasmid pcDNA3 p-gal. Twenty four hours post 
transfection cell lysates were assayed for p-galactosidase and CAT 
enzyme activity as described in Experimental Procedures. Cell lysate 
CAT enzyme activities were corrected for transfection efficiency and 
expressed as a percentage of the CAT enzyme activity obtained with the 
SV4Q promoter/enhancer plasmid pCATControl in the same experiment. 
All cell lysate CAT enzyme activities were within the linear range of the 
assay as determined from a CAT enzyme dilution series analysed in the 
same experiment. The data shown are from a single transfection 
experiment and the promoter activities were reproducible in at least two 
independent transfection experiments with each construct. 

Tgfrte 3 



Plasmid 

-2940CD68pCAT 

hLZMpCAT 

mLZMpCAT 

CD11bpCAT 

c-fes pCAT 

hMSRpCAT 



Gene 

hCD68 
hLysozyme 
mLysozyme 
hCD11b 
h c-fes 
h MSR 



Promoter 

-2940 to +2" 
-517 to +26 
-487 to +11 
-1706 to +91 
-446 to +71 
-487 to +11 



Accession 
No 

This paper 

X57103 

D13263 

M76724 

X06292 

D13263 



Table 3 shows Myeloid promoters used in this study 

All myeloid gene promoter fragments were cloned into the multiple cloning 
site of the CAT reporter vector pCATBasic (See Experimental Procedures). 
Human genes are denoted by the prefix h and murine genes by the prefix 
m. 

Promoter coordinates shown are taken from the given Genbank accession 
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numbers and the major transcription initiation site is denoted as +1 except 
for 

CD68 where the A of the ATG translation initiation codon is denoted aV+T 
O. 

Ta ble 5 The effect of the CD68 first intron in transiently 
transfected RAW264 7 an d CHO cell lines, 

RAW264.7 cells were electroporated in the presence of 20jig of the 
indicated CAT reporter plasmids and 5^ig of the P-galactosidase 
reporter plasmid pcDNA3 p-gal. Forty eight hours post transfection 
transfected cell lysates were assayed for P-galactosidase and CAT 
enzyme activity. CHO cells were transfected with 4.5^g of the indicated 
CAT reporter plasmids and 0.5jig of the p-galactosidase reporter 
plasmid pcDNA3 p-gal complexed with 50>g of the cationic lipid 
Lipofectamine (Gibco BRL). Cell lysate CAT enzyme activities were 
corrected for transfection efficiency and expressed as a percentage of 
the CAT enzyme activity obtained with the SV40 promoter/enhancer 
plasmid pCATControl analysed in the same experiment. All cell lysate 
CAT enzyme activities were within the linear range of the assay as 
determined from a CAT enzyme dilution series analysed in the same 
experiment. The data shown are from a single transfection experiment 
and the promoter activities were reproducible in at least three 
independent transfection experiments with each construct in each cell 
line. 

Table 4 The effect of the CD68 first intron on CD68 and HIV promoters 



RAW264.7 and P388.D1 cells were electroporated in the presence of 
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20^ig of the indicated CAT reporter plasmids and 5jig of the p- 
galactosidase reporter plasmid pcDNA3 p-gai. Forty eight hours post 
transfection traTisfected cell lysates were assayed for p-galactosidase an? 
CAT enzyme activity. Cell lysate CAT enzyme activities were corrected 
for transfection efficiency and expressed as a percentage of the CAT 
enzyme activity obtained with the SV40 promoter/enhancer piasmid 
pCATControl analysed in the same experiment. All cell lysate CAT 
enzyme activities were within the linear range of the assay as 
determined from-a CAT enzyme dilution series analysed in the same 
experiment. The data shown are from a single transfection experiment 
and the promoter activities were reproducible in at least three 
independent transfection experiments with each construct in each cell 
line. 

METHODS 

Construction of promoter reporter plasmids 

A 2940bp BstXI fragment was purified from an EcoRI - Spel fragment 
subcloned from cosCD68C1. The BstXI fragment was rendered blunt 
ended by incubation with T4 DNA polymerase and all four dNTPs and 
cloned into the EcoRV site of pBluescipt SK- (Stratagene) to give plasmid 
pCD68Bst3-2. The 3' BstXI site contains the CD68 ATG initiation codon 
which was removed by the 3' exonuclease activity of T4 DNA polymerase. 
A Hind III - Xbal fragment containing 2940 bp of DNA upstream of the 
CD68 ATG codon was cloned into the reporter vector pCAT Basic 
(Promega, Genbank accession number X65322) to give the plasmid -2940 
CD68pCAT. Plasmid -2940 CD68pCAT was digested with Xhol, Bglll or 
Sstl and overhanging ends filled in by treatment with the Klenow fragment 
of DNA polymerase or T4 DNA polymerase before ligation of 
phosphoryiated Hindlll linkers. Following digestion with Hindlll and Xbal, 
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5' truncated CD68 promoter fragments were gel purified and subcloned 
into Hindlll and Xbal digested pCATBasic to give the plasmids - 
2258CD68pCAT, -1576CD68pCAT and -951CD68pCAT. All other CD68 ~ 
promoter deletions were prepared by PCR using plasmid -2940 
5 CD68pCAT as a template and 5' oligonucleotide primers which added a 
Hindlll site and a common 3' PCR primer which spanned the Xbal cloning 
site of plasmid -2940 CD68pCAT (listed in Table 1). Amplified fragments 
were digested with Hindlll and Xbal, gel purifed and subcloned into Hindlll 
and Xbal digested pCATBasic to give the plasmids -333CD68pCAT, - 
io 232CD68pCAT, -150CD68pCAT and -80CD68pCAT. The first intron of the 
CD68 gene was PCR amplified using primers 5 1 
ccggaattcTGCTGGGGCTACTGGCAG and 5' 

tgatctagaGTCCCCTGGGCTTTTGGCAG which added EcoRl and Xbal 
sites (underlined). Following EcoRl and Xbal digestion the CD68 intron 

15 fragment was cloned into pCD68Bst3-2 digested with EcoRl and Xbal to 
give plasmid pCD68BstIVS and a 3022bp Hindlll - Xbal fragment was 
cloned into pCATBasic to give the plasmid -2940 IVSpCAT. The HIV 
minimal LTR construct HIV pCAT is from Lew et al. (1991) MoL Cell. Biol. 
11, 182-191. Construct HIV IVS pCAT was made by ligating the EcoRl to 

20 Xba I IVS I fragment of pCD68BstlVS into the unique Bglll site in the HIV 
tar sequence. All CD68 promoter reporter constructs were sequenced 
using M13 reverse and CAT primers and shown to exactly match the CD68 
promoter sequence shown in Figure.A 1.7 kb Hindlll-BamHI (blunt) 
fragment containing CD11b sequences from -1706 to +91 was excised 

25 from the plasmid pB202 (Dzjemmis et al, 1995, Blood, 85, 319-329 and 
cloned between the Hindlll and Xbal (blunt) sites of pCATBasic to give the 
clone CD1 1b pCAT. A 516bp Hind III - Xba I fragment containing c-fes 
sequences from -446 to +71 was excised from the plasmid p446 (a kind gift 
of Celeste Simon, Heydemann et al., 1996) and cloned between the Hind 

30 III and Xba I sites of pCATBasic to give the plasmid c-fes pCAT. Other 
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myeloid gene promoters were cloned by PCR using the primers designed 
from published sequences listed in Table I with cloned DNA or human 
gensmie-DNA templates. A Sma-BamHf fragment containing the SV40 
early splice and polyadenyfation sequences from plasmid pMSG 
(Pharmacia) was ligated into Smal-BamHI digested plasmid pH2KBS 
which contains the 1 .6kb H2K cDNA cloned into the EcoRV site of 
pBluescript (a kind gift of Dr D. Mokophidis) to give the plasmid pH2KSV. 
A 2.5kb Hindlli-BamHI fragment of pH2KSV was rendered blunt ended by 
treatment with T4 DNA polymerase and ligated into Smal digested CD68 
promoter plasmid pCD68Bst3-2 to give plasmid pCD68-H2KSV and the 
same pH2KSV fragment was ligated into Hindi digested human iysozyme 
promoter plasmid pBH7.4 (Clarke et al. 1996) to give hL2M-H2KSV. 
Plasmid pkb-Hindlli contains a 7.4kb Hindlll genomic DNA fragment of the 
H2Kb gene (Weiss et al M 1992). 

Supercoiied plasmid DNAs were prepared from 500ml cultures of E.coli by 
NaOH/SDS lysis and purified by equilibrium centrifugation in CsCI/ethidium 
bromide gradients followed by phenol/chloroform extraction and ethanol 
precipitation (Sambrook, Fritsch & Maniatis 1989). 

Mammalian cell culture and transient transfection 

The murine macrophage cell lines RAW264.7 and P388.D1 were 
maintained in RPM1 1640 medium (Gibco BRL) supplemented with 10% 
heat inactivated foetal calf serum (PCS) (Sigma), 100 units ml-1 penicillin, 
100 tig ml-1 streptomycin, 2mM glutamine and 10mM Hepes (pH 7.0). 
CHO K1 ceils were maintained in Ham's F-12 medium (Gibco BRL) 
supplemented with 10% PCS, antibiotics and glutamine. All cells were 
grown at 37oC in a humidifed incubator in 5% C02/air. RAW264.7 and 
P388.D1 cells were grown to confluence in T175 flasks, harvested in 
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Phosphate Buffered Saline (PBS) and washed once and resuspended in 
Optimem 1 serum free medium (Gibco BRL) for RAW cells or RPMI 1640 
(no FCS) for P388D1 cells . Cells were counted and adjusted to a final — 
density of 4 x 107 cells ml-1, Aliquots of 2 x 107cells (0.5mt) were mixed 

5 with 50 jag CAT reporter plasmid DNA and 5 \xg pcDNA3 b-galactosidase 
plasmid DNA, added to a 0.4cm electrode gap electroporation cuvette 
(BioRad) and shocked in a BioRad GenePulser (300V, 960 jiFD) at room 
temperature. Cells were recovered immediately into 10ml of cell growth 
medium which- had been pre-warmecHo-37oC and plated into 35mm and 

10 9cm diameter tissue culture petri dishes (Nunc). Cells were analysed 24 or 
48 hours post electroporation for transfection efficiency by staining fixed 
permeabilised cells with 5-bromo-4-chloro-3-indoJyl-p-D^alactopyranoside 
(X-gal, Sigma) as described (Hogan et at. 1994). Transient transfection 
efficiencies of between 20 and 30% were routinely obtained with P388.D1 

15 murine macrophages and after optimisation transient transfection 

efficiencies in excess of 40% could be obtained with RAW264.7 cells. CHO 
cells were grown to 70-80% confluence in 9cm petri dishes, washed twice 
with Optimem before addition of 5ml of plasmid DNA:cationic lipid complex 
(5 jig DNA:50 ^g Lipofectamine (Gibco BRL) in Optimem). After 6-16 

20 hours incubation the medium was aspirated, cells were washed twice with 
PBS and recovered into complete medium for 24-36 hours before analysis. 
X-gal staining and FACS analysis routinely showed CHO cell transient 
transfection efficiencies in excess of 40%. 

25 Reporter gene assays 

Transfected cells were harvested by scraping in PBS, 
washed once with PBS and cell pellets were resuspended in 100jil 0.25M 
Tris-HCl (pH 7.8) and subjected to three rounds of freeze thaw lysis. Cell 
lysates were assayed for p-galactosidase enzyme activity using the 

30 colorimetric substrate chlorphenolred p-D-galactopyranoside (CPRG, 
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Boehringer Mannheim) in a 96 well plate assay in 50mM potassium 
phosphate buffer (pH7.3) with 2mM MgCI2. Enzyme activity was 
determined by spectrophotometry at 570nm after 30 minutes incubation at 
37oC using dilutions of purified Exoli p-galactosidase enzyme (Sigma) to 
generate .a standard curve. CAT enzyme activity of cell lysates was 
determined after heat treatment (65oC, 20 minutes) using 10 fiCi 14C 
labeled chloramphenicol (54Ci mmol-1, Amersham) as substrate in a 125pJ 
reaction in 0.25M Tris-HCI (pH 8.0) containing 0.2 mg ml-1 n-butyryl CoA 
as cofactor. CAT enzyme activity was measured by determining the 
amount of butyryl-14C chloramphenicol extracted into mixed xylenes 
(Aldrich) after a 2 hour incubation at 37oC (Seed ref). A CAT enzyme 
standard curve was generated using dilutions of purified CAT enzyme 
(Promega) in each experiment and all CAT enzyme assays using 
transfected cell extracts were within the linear range of the enzyme assay. 

Figure Restriction maps of cosmids containing the human CD68 gene. 

Recombinant cosmids containing the human CD68 gene in 
the vector pWE1 5 were isolated by PCR screening of pools of robotically 
picked Ampicillin resistant HB101 colonies using PCR primers CD68 L1 & 
CD68 L2. CD68 PCR positive single colonies were used to prepare 
cosmid DNA which was analysed by restriction mapping and Southern 
blotting using radioactively labelled CD68 gene probes. Sites for the 
restriction enzymes Spe I (Spe) r C!a I and Not I are shown. Not I sites in 
brackets (Not I) are derived from the pWE15 cosmid vector cloning site and 
flank the human genomic DNA insert. 

Below the map of cosmid cosCD68C1 is shown the position 
of a 20kb EcoRI fragment cloned into the plasmid vector pBluescriptSK- to 
give the recombinant piasmid pCD68R1A. The positions of other 
cosCD68C1 restriction fragments cloned into the plasmid vector 
pBluescriptSK- are also indicated. The 3kb BstXI fragment the 3* end of 
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which contains the CD68 gene ATG initiation codon is indicated. 

Figure 2 . DNA sequence of the 5' flanking region of the human CD68 
gene. 

5 The DNA sequence of the 5* flanking region of the CD68 

gene was determined by double strand sequencing using the CD68 
cosmids as template and oligonucleotide primers derived from the 
published human CD68 cDNA sequence (17).The initiator Methionine 
encoding ATG codon of the CD68 gene which is contained within a BstXI 

10 restriction enzyme recognition site is boxed. CD68 coding regions are 
underlined and intervening sequences inferred from comparison with the 
published CD68 cDNA sequence are delimited by vertical arrows. The 
sequence has not yet been confirmed on both strands, ambiguities in 
multiple sequence reads are indicated by lowercase letters according to 

15 the IUPAC-IUB standards described in Nuc. Acids. Res. 13, 3021-3030 
(1985). Dots in the sequence indicate a chemical bond. 

Figure 3 . Northern blot analysis of RNA expression in stably transfected 
RAW cells. 

20 Total RNA was prepared from G41 8 resistant RAW cell 

populations (derived from at least 10,000 independent G418 resistant 
colonies) or polyclones (derived from 50 -100 independent G418 resistant 
colonies). RAW cells were transfected with 10 \ig of supercoiled or Pvu I 
linearised CD68 cosmids or a pcDNA3 CAT plasmid which confers 

25 resistance to G418. 

Total RNA (10 \xg) was denatured with formaldehyde, 
subjected to agarose gel electrophoresis, transferred to nylon membranes 
and hybridised with radioactively labelled human CD68 and mouse 
lysozyrne probes. For comparison total RNA (5 ^ig) prepared from human 

30 THP1 cells treated with PMA for 24 hours and human PBMC cultures was 
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analysed. A 6 hour autoradiographic exposure is shown. 

Figure 4 . RT PCR analysis of CD68 and macrosialin RNAs in stably 
transfected RAW cells. 

5 RNAs (1 0 \ig) prepared from the G41 8 resistant RAW cell 

populations analysed by Northern blotting in Figure 3 were used to prepare 
oligo dT-primed cDNA in a reverse transcription (RT) reaction in a final 
volume of 100 Control RT reactions omitting reverse transcriptase were 
performed using the same RNA samples (-RT). A 30 cycle Polymerase 

io Chain Reaction (PCR) using macrosialin and CD68- specific primers was 
performed using 1 \i\ of the neat RT reaction or indicated dilutions as 
template. The RT PCR products were analysed by agarose gel 
electrophoresis and the position of mouse macrosialin and human CD68 
PCR products of the predicted sizes are indicated. 

15 

Figure 5 . Northern blot analysis of RNA expression in stably transfected 
RAW and A20 cells. 

RAW and A20 B-cells were transfected with 10 \ig of 
supercoiled or Pvu I linearised CD68 cosmids or a pcDNA3 CAT plasmid 

20 which confers resistance to G418. Total RNA (10 \ig) prepared from G418 
resistant cell populations was denatured with formaldehyde, subjected to 
agarose gel electrophoresis, transferred to nylon membranes and 
hybridised with a radioactively labelled human CD68 probe. For 
comparison total RNA prepared from human THP1 cells treated with PMA 

25 for 24 hours (5 p.g) and human PBMC cultures (2 jig) was analysed. A 6 
hour autoradiographic exposure is shown. 

Figure 6 . RT PCR analysis of CD68 and HPRT RNAs in stably 
transfected RAW and A20 cells. 
30 RNAs (1 0 ^ig) prepared from the G41 8 resistant RAW cell 
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populations analysed by Northern blotting in Figures 3 & 5 were used to 
prepare oligo dT-primed cDNA in a reverse transcription (RT) reaction in a 
final volume of 100 ul A 30 cycle Polymerase Chain Reaction (PCR) 
CD68- and HPRT- specific primers was performed using 1R1 of the neat 
5 RT reaction or indicated dilutions as template. The RT PCR products were 
analysed by agarose gel electrophoresis and the position of human CD68 
PCR and HPRT PCR products of the predicted sizes are indicated. 



Table 1 

10 

This table shows the effect of deleting CD68 5' promoter sequences in 
RAW and P388.D1 transfections 



-2940pCAT 


38.6 


78.4 


-666pCAT 


37.5 


74.4 


-575pCAT 


48.4 


129.5 


-460pCAT 


71.6 


132.2 


-333pCCAT 


30.5 


112.3 


-232pCAT 


34.4 


96.4 


-150pCAT 


11.6 


176 


-80pCAT 


0.64 


9.5 


pCATBasic 


0 


2 


pCATControl 


100 


100 


Plasmid construct 


RAW264.7 


P388.D1 



Table 2 

15 



This table compares the level of CAT reporter gene activity of plasmids 
with different myeloid gene promoters in P388.D1 and RAW264.7 
pCATControl 100 100 
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pCAT Basic 


0 


8.4 


-2940 CD681VS 


117 


163.9 


-2940 CD68 pCAT 


39 


50 


hLZM pCAT 


17.6 


66 


mLZM pCAT 


20.1 


73.3 


CD11bpCAT 


42 


23 


-c-fes pCAT 


4.8 


47.3 


hMSR pCAT 


3 


35.5 



CAT Reporter RAW264.7 P388.D1 

Plasmid 

Table 4 

This table compares the level of CAT reporter gene activity of plasmids 
with CD68 and HIV minimal LTR promoters in P388.D1 and RAW264.7 
cells 



pCAT Control 100 100 

pCAT Basic 0 8.4 

-2940CD681VS 117 163.9 

-2940 CD68pCAT 39 50 

HIVlVSpCAT 0.8 15 

HIVpCAT 0.8 69 

CAT Reporter RAW264.7 P388D1 
Plasmid 



Table 5 

This table shows the effect of adding the CD68 IVS intron on to different 
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CD68 promoter fragments in RAW and CHO cells 



pCAT Control 


100 


100 


pCAT Basic 


0 


0 


-2940 CD68pCAT 


31 


56 


-2940 IVS 


154 


63 


-80 pCAT 


0.6 


5 


-80 IVS 


53 


1.9 



Plasmid construct RAW264.7 CHO 



Potential App lications for the CD68 LCR 

The Macroph a ge as a del ivery vehicle for gene therapy 

Macrophages have several important advantages over other 
cell types for deiivering therapeutic gene products in a range of important 

human diseases 

Macrophages have a high biosynthetic capacity. 
Macrophages secrete physiologically significant amounts of cytokines, 
growth factors, inflammatory mediators, proteases, protease inhibitors and 
other important biologically active macromolecuies. 

Macrophages have a limited life span. 
After commitment tissue resident and recruited macrophages undergo only 
one or two cell divisions at most, this would be a distinct advantage in 
many human gene therapy protocols. 

Macrophages are found in virtually all tissues. 
The presence of macrophages in virtually every tissue in the body in 
significant numbers increases the utility of an LCR which directs 
macrophage-specific gene expression. The presence of macrophages in 
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the lung and gut allows for recombinant DNA delivery to macrophages by a 
number of different routes. 

Macrophages are rapidly recruited into sites of inflammation. 
The ability to direct heterologous gene expression in a cell type which is 
recruited to sites of inflammation offers unique avenue for therapeutic 
intervention in chronic inflammatory disease. 
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Somatic Gene Therapy 

There are currently no human monogenic disorders described which 
specifically affect macrophage function which would be candidates for 
"classical" somatic gene therapy. However the CD68 LCR could be used 
5 to drive the expression of therapeutic gene products in human 

macrophages in a whole range of important human diseases. Candidate 
genes would include: Local delivery of soluble IL-1Ra to inflammed rat 
knee joints by recombinant retroviral vectors has been shown to be 10 000 
times more efficient than systemic administration of slL-1RA in reducing 

io experimentally-induced arthritis (9) Makarov, S.S., et al Proc. Natl. Acad. 
ScL (USA), 1996, 93; 402-406. 

Currently there is much interest in anti TNF-a therapy in 
treatment of toxic shock syndrome and a number of autoimmune diseases 
such as rheumatoid arthritis which afflicts 1% of the UK population. 

15 Many important human diseases are the result of a failure in regulation of 
the immune system. Macrophages and cytokines secreted by 
macrophages such as IL-12 and IL-10 play a key role in regulating 
T-lymphocyte function. Macrophages-specific expression of trans- 
dominant negative IL-4 receptors could find application in autoimmune 

20 diseases such as Ulcerative Colitis and Crohn's Disease. 

Apo E and several other gene products have been proposed 
to play a key role in the development of atherosclerotic plaques which are 
the cause of blood vessel occlusion in atherosclerosis and strokes. Apo E 
expressed in macrophages present in atherosclerotic plaques might find 

25 application in the treatment of vascular occlusion (1 0). CD68 is present in 
the macrophage derived foam cells found in human disease tissue and we 
have shown the mouse homologue of CD68, macrosialin, to be present at 
high levels in the atherosclerotic pJaques of an apo E" A mouse model of 
atherosclerosis. 

30 The murine homologue of CD68, macrosialin has been 
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reported to bind oxidised LDL (11). Overexpression of human CD68, 
murine macrosialin or human Macrophage Scavenger Receptor in 
macrophages present in atherosclerotic plaques may usefully reduce the 
amount of atherogenic oxidised LDL in atherosclerotic lesions. 

Chronic Granulomatous Disease (CGD) is a genetic disease 
caused by mutation of the NAPDH oxidase gene. CGD patients fail to 
make reactive oxygen metabolites in their phagocytic cells which kill 
bacteria and hence patients suffer from recurrent bacterial infections. 
Expression of NAPDH oxidase in macrophages of CGD patients by 
somatic gene therapy would be highly beneficial (12). 

There are several relatively rare human genetic diseases 
which can be treated by providing purified proteins which can be taken up 
by defective cells to correct their genetic defect. Examples include Beta 
cerebrosidase in patients with Gaucher's disease and other genes 
defective in lysosomal storage disorders (13). A particular attraction of the 
CD68 LCR for treatment of lysosomal storage diseases is the fact that 
CD68 is expressed in microglia, mononuclear phagocytic cells resident in 
the brain. Some microglial cells in adults are derived from recruited blood 
monocytes and hence the CD68 LCR may offer the possibility of 
expressing therapeutic gene products in the brain via recruited blood 
monocytes transfected with CD68 LCR vectors. 

Treatment of infection s diseases 

Several important human pathogens survive and replicate in 
the endosomal compartment of macrophages. These include the 
causative agents of tuberculosis, leismaniasis and leprosy. The HIV virus 
survives and replicates in monocytes. Targeting y-interferon production to 
macrophages infected with Mycobacterium bovis could be a viable strategy 
for treatment of Tb. The delivery of HIV decoy tar sequences and other 
anti HIV reagents to monocytes and macrophages would create a pool of 
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monocytes resistant to HIV infection. Similar gene therapy protocols for 
the "intracellular vaccination" of T-lymphocytes have been proposed. 

Macrophage Expression Systems 

The original p- giobin LCR vectors have been used to 
develop expression systems for the over production of heterologous gene 
products in cultured erythroid cell lines(14). The CD68 LCR could be used 
for over expression of heterologous genes in human and murine 
macrophage cell lines for instance in the production of cytokines and 
soluble receptors whose pattern of glycosylation was important for their 
biological activity. 

Genetic Vaccination 

Recently it was shown that naked DNA could be used in 
animals to confer protective immunity against lethal virus challenge and to 
elicit significant humoral antibody responses. Current protocols for genetic 
vaccination use standard mammalian expression vectors based on the 
SV40 or HCMV promoters and enhancers. Heterologous genes are 
expressed in myofibres but the cells presenting foreign antigen for the 
elaboration of an effective immune response are unknown (15). Using a 
CD68 LCR vector to target heterologous gene expression to macrophages 
and possibly dendritic cells should increase the efficiency and efficacy of 
genetic vaccination protocols. 

Immunotherapy 

Dendritic cells process antigens for presentation to cells of 
the immune system. Human dendritic cells expressing the CD68 antigen 
are important in conferring tolerance in organ transplant rejection and 
autoimmune diseases. A CD68 LCR will allow for the genetic targeting of 
an important subset of Dendritic cells grown in culture from bone marrow or 
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peripheral bfood precursors. 
Compositions 

The invention also relates to compositions comprising the 
polynucleotide, vector or transfected ceil. Thus, the polypeptides of the 
5 present invention may be employed in combination with a non-sterile or 
sterile carrier or carriers for use with cells, tissues or organisms, such as a 
pharmaceutical earner suitable for administration to a subject. Such 
compositions comprise, for instance, a media additive or a therapeutically 
effective amount of a polypeptide of the invention and a pharmaceutical^ 
10 acceptable carrier or excipient. Such carriers may include, but are not limited 
to, saline, buffered saline, dextrose, water, glycerol, ethanol and 
combinations thereof. The formulation should suit the mode of 
administration. 

Kits 

is The invention further relates to diagnostic and pharmaceutical packs 

and kits comprising one or more containers filled with one or more of the 
ingredients of the aforementioned compositions of the invention. Associated 
with such containers) can be a notice in the form prescribed by a 
governmental agency regulating the manufacture, use or sale of 

20 pharmaceuticals or biological products, reflecting approval by the agency of 
the manufacture, use or sale of the product for human administration. 
Administration 

Polynucleotides and other compounds of the present invention may 
be employed atone or in conjunction with other compounds, such as 
25 therapeutic compounds. 

The pharmaceutical compositions may be administered in any 
effective, convenient manner including, for instance, administration by 
topical, oral, anal, vaginal, intravenous, intraperitoneal, intramuscular, 
subcutaneous, intranasal or intradermal routes among others. 
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The pharmaceutical compositions generally are administered in an 
amount effective for treatment or prophylaxis of a specific indication or 
indications. In general, the compositions are administered in an amount of 
at least about 10 pg/kg body weight. In most cases they will be administered 
in an amount not in excess of about 8 mg/kg body weight per day. 
Preferably, in most cases, dose is from about 10 pg/kg to about 1 mg/kg 
body weight daily. It will be appreciated that optimum dosage will be 
determined by standard methods for each treatment modality and indication, 
taking into account the indication, its severity, route of administration, 
complicating conditions and the like. 

In therapy or as a prophylactic, the active agent may be 
administered to an individual as an injectable composition, for example as 
a sterile aqueous dispersion, preferably isotonic. 

Alternatively the composition may be formulated for topical 
application for example in the form of ointments, creams, lotions, eye 
ointments, eye drops, ear drops, mouthwash, impregnated dressings and 
sutures and aerosols, and may contain appropriate conventional additives, 
including, for example, preservatives, solvents to assist drug penetration, 
and emollients in ointments and creams. Such topical formulations may 
also contain compatible conventional carriers, for example cream or 
ointment bases, and ethanol or oleyl alcohol for lotions. Such carriers may 
constitute from about 1% to about 98% by weight of the formulation; more 
usually they will constitute up to about 80% by weight of the formulation. 

For administration to mammals, and particularly humans, it is 
expected that the daily dosage level of the active agent will be from 0.01 
mg/kg to 10 mg/kg, typically around 1 mg/kg. The physician in any event 
will determine the actual dosage which will be most suitable for an 
individual and will vary with the age, weight and response of the particular 
individual. The above dosages are exemplary of the average case. There 
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can, of course, be individual instances where higher or lower dosage 
ranges are merited, and such are within the scope of this invention. 

A vaccine composition is conveniently in injectable form, or 
may be administered by other techniques, for example a gene gun using, 
5 for example, gold particles, or §x vivo Conventional adjuvants may be 
employed to enhance the immune response. 

A suitable unit dose for vaccination is 0.5-5^g/kg of antigen, and 
such dose is preferably administered 1-3 times and with an interval of 1-3 
weeks. 

10 With the indicated dose range, no adverse toxicological effects wilt 

be observed with the compounds of the invention which would preclude 
their administration to suitable individuals. 

The polynucleotide sequences, vectors and host cells of the present 
invention may be used in the treatment of infectious diseases including 
15 viral and bacterial infections such as HIV and TB, inflammatory diseases, 
cardiovascular diseases, rheumatoid arthritis, atherosclerosis, restinosis, 
cancer, for example cancer of the bowel, colon, breast or lung. 

Other ucp Pate nts 

20 The published LCR most similar in targeting transgene 

expression to macrophages is the Class li LCR . This element has been 
claimed to direct expression to (presumably activated) macrophages, 
dendritic cells and B-lymphocytes (16). Our data with stably transfected 
murine macrophage line RAW 264.7 and the murine B-cell line A20 already 

25 show the CD68 cosmids direct expression which is restricted to 
macrophages (Figures 6 8 6). 
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Seq ID No. 1 

REVERSE- COMPLEMENT of: Drg 237. Con check: 1236 from: 1 to: 3601 
Drg237.rev Length: 3601 April 29, 1996 11.13 Type N Check: 6126 

1 TGTCCTGGAA CCCAGGTGCC TACCTGGTCT GCTGCATATT TGTTTTCTCT 

51 TCCAGCATGG AGATATGGnA CCAAAAGGAA CGAGTGCTCA GAGTTTTGAT 

101 TACCAnTGAC CTGCTGGTGA GTAGAGGGAA CTGATAGCAA AGGCAGAAGG 

151 GAGGATCCAA GGTGATTCCC TCTCCAAGGC AAGTTCGGAA AGTAGCAGCT 

201 TGGAATAGAA TCTGGCATGC CTAAGGCCTT TGGGGAACTG GGATG CTTAT 

251 TTCCTCTGCC TTCCTTGGCT GCCCACATGG ATGCCTAAGT GTCTTCCCTC 

301 CGGGATAGAG TGTCCTCCGT GCACATGCTG AAGAGTTGTC TTTCTTGACG 

351 TAGGCCAGAG GCATTGATGT GCAGCAGGTT TCTTTAGTCA TCAAcTATGA 

401 CCTTCCCACC AACAGGGAAA ACTATATCCA CAGGTAAGCG TAGATCTGGA 

451 ACATTCCCAn ACCCTTTCAC ACcTGGCCcT CCCTGGGCTT AAAGCTCCTG 

501 ATATTCCTCA TCCCCTTCCT TGTTTTCCAG AATCGGTCGA GGTGGACGGT 

551 TTGGCCGTAA AGGTGTGGCT ATTAACATGG TGACAGAAGA AGACAAGAGG 

601 AyTcTTCGAG ACATTGAGAC cTTCTACAAC ACCTCCATTG AGGAAATGCC 

651 CCTCAATGTT GCTGACCTCA TCTGAGGGGC TGTCcTGCCA CCCAsCCCCA 

701 gCCAsgGcTC AAkyTcTGGG GGCTGAGGAk CwgCAGGAGG GGGGAGGGAA 

751 GGGAGCCAAG GGATGGACAT CTTGTcAtTT TTTTTtCTTT GAATAAATGT 

801 CACTTTTTGA GGCAAAAGAA GGAACCGTGA ACATTTTAGA CACCCTTTTC 

851 TTTGGGGTAG GCTCTTGCCC CAGGCGCCGG CTCTTCTCCC AAAAAAAAAA 

901 AAAAAACAcT AaTCCATTTC CCTAACcTAg TAACcTCCAG ATCCCAGAGG 

951 CTCTCCTCAC CTCAGCTGAG CTCCTTTGAA AGTGATTCAA GGGAcTATGT 

1001 CAcTCAgCcT CATTTGcTGG ACCAAATcTG GAGGGAGAAC CCCTAAAACC 

1051 CcTAAGTGAG GTTGCCCAGG GGGTTGTCCC CAGGTGGGGG GAAGCAGGGG 

1101 • AGAGAAAATG GTAGCCATTT TtACATTGTT TTGTATAGTA TTTATTGATT 

1151 CAGGAAACAA ACACAAAATT cTGAATAAAA TGACTTGGAA ACTGCCTGTT 

1201 TGGGCTTCTC ATTTCTtACC TCCCCTTCCC TCTCCCACCT GmTAcTGGGT 
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1251 


GCATcTCTGC 


T\ CCCCCCTT 


1301 


GCTTTCTTGT 


CACCATCTGA 


1351 


GCTCTGTGAA 


tGACAATGcT 


1401 


GGGTGTGGAT 


GGTCACAAGC 


1451 


tACAGGggTC 


AGCCCAaAaq 


1501 


CCACCAA . GG 


GAaGGGCcTG 


1551 


CAACTGCCAA 


G CAAcr TTTCG 


1601 


TCAAcTGCCc 


TAcrG Ar*TP c Q 


1651 


CCTG Ac aAa 


y v— «y y y v^v-ria. 


1701 


G Ac T AAG Aa A 


y Aiy o x V3uu m 


1751 


TGGGATCaTc 


tCCAgTACAG 


1801 


Ag.AgAgGGc 


TGAgGGAgCa 


1851 


TGGActTTGG 


GTgAgGc . . g 


1901 


cGGGGGcCCT 


GcTGGGGcTA 


1951 


GAGGGGG . . c 


CCCTGGGAGG 


2001 


T * * » # 0*1* 


GCCAAAAGCC 


2051 


AAAATCAGcT 


ACTTtGcTGC 


2101 


AGAGCACTGG 


AACAAcCAGC 



.CCCCAGCAG ATGGTTACCT TTGGGCTGTT 
GTTCTCAGAC GCTGGAAAGC CATGTTCTCG 
GAcTGGAGTG CTGCCCCTCT GTAAAGGGcT 
CCtTcACATG CyTCAGCCAA GAgGAAGTAG 
TCCAGGGGAA AgGAgtgGAA AcCGATTTCC 
TACcTCAgcT GTTCCCATAg cTACTTGCCA 
cTGAgTTtGA CACATGG . AT CCC.TGTGGA 
TTTGCACCCA TgtgACacTG ttGAcTTTGC 
cagtccccta AcTTAATtAC aAaAAcTAAT 
gAgCTGAgGC CCCTG,AgTC AgGcTGTGGG 
GAAgtGAGAc TTTCATTT.C ctCCtTTcCA 
gGGTTgAgCa ActGGTGCAg ACAgCCTAgc 
gtTCAgCCAT gAgGctGGcT gTgCttTtcT 
cTGGCAGGTA AGGAGGAAgG AgGcTGAGGG 
GAGCcTG.CC CTGGGTTGct AACCATcTCC 
CAGGGGACAG GGAATGAC . T GTCCTCACAA 
CATCCtTCAC GGTGACACCC ACGGtTACAG 
CACAGGACTA CCAAgAGCCA CAAAACCACC 
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Seq ID No 2 

TTCCAAGAGAGGGCTGAGGGAGCAGGGTTGAGCAACTGG 

TGCAGACAGCCTAGCTGGACTTTGGGTGAGGCGGTTCAGCC AGGAATCC 
TGCTGGGGCTACTGGCAG gtaaggagCCCAG 

gaaggaggctgaggggagggggcccctgggagggagcctgccctggg 
ttgctaaccatctcctctctgccaaaagCCCAG 
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aattcggttctccaatcccctgggtcactttgctcttgtgcacgctttcc 
agtctttcagcgtaagccagagtcattcccaaggatgctggtttctctct 
gggggaagagctgctctgtgatggagcccatgcgtgtcatctgagcctct 
ggcttccctgccagtgcagccctggcagtgtcctacttcccagggctgtt 
gtctgcctggcgggaaggtcctgggcaaaggatcagtctttgtactctga 
gagcagactacttggctcctctctgttttttatcagcgaagttggatata 
tctctcccacatttccctaatcatatgctatatattggctttttttttct 

tctctaa CCCCCAAATACATCAAGATGTTTGTACTGGATGA AG fTG A CG A 
AATGTTAAGCCGTGGATTCAAGGACCAGATCTATGACATATTCCAAAAGC 

TCAACAGCAACACCCAGgtgagggcagtcttgcttgaatagctaatgatt 
cttgaaaaatagtaagtgccaggggaaaccaaatactggattcttgagcc 
tttttatgcatctgcttcagttttaggtgtggctagggaagggagcaggc 
ctcaggaaggaaccagcactctaagactggcctttttttccactagSI£^ 
TTTTGCTGTCAgCCAC^TGCCTTCTGATgTgCTTgAggTgACCMgMg 
TTCATGAGGGACCCCATTCGGATTCTTgTCAAGAAGGAAGAGTTgACCCT 

GGAGGGT AT C CG C CAGTT CT A C AT CAACGTGG AA CG AG AG a t oogg ccca 
gtgcaggaggcgggcctggtagtgagttgttgggtatagcccctgactga 

1 1 1 1 1 g t c cc ccaac c t c c a gG agtgg aagctggacacact at gtgactt 
GTATGZWHCCCTGftCCATCACCCAGGCAGTCATCTTCATCAAGACCCGGft 

ggaaggtggactgggtcaccgagaagatgcatgctcgagatttcactgta 

TCCGCCATG gt-gtgttitgcccactaccaacctattataqatctacccatc 
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agaagtgtcctacttgaagccagggttcctggaacccaggtgcctacctg 
cftctqctqcatatttattttctcttccaaCATC3flAnATATnr;apraftftftf3 

GAACGAGACGTCATTATr, AGGGAGTTTCaTTCTG!P!rTCTAGrAnAr,TTTT 

GATTACCftCTGA^CTGrTGgtgagtagagggaactgatagcaaaggcaga 
agggaggatccaaggtgattccctctccaaggggacatcagtgcctctca 
ggaaagtagcagcttggaatagaatctggcatgcctaaggcctttgggga 
actgggatgcttatttcctctgccttccttggctgcccacatggatgcct 
aagtgtcttccctccgggatagagtgtcctccgtgcacatgctgaagagt 

tgtctttCttqacqtaqGCCAGAGGCATTGATGTGrAn CAGGTTTrTTfA 
GTCATCAACTATGACCTTCrCACCAAGAGG GAAAArTATATrrarA ^gt-^ 

agcgtagatctggaacaytcccntacccnttcacacctggccctccctgg 
gcttaaagctcctgatattcctcatccccttccttgttttccaq AATCGG 
TCG AGGTGG ACGGTTTGGCflGT A A AGGTGT GGCT A TT A A P A TGGTG A P A P, 

AAGAAGACAAGAGGANTCTTCGAGACATT GAGArrTTrTArAAPArrTrr 
ATTGAGGAAATGCCCCTrAATGTTGrT GACCTrATrTGAr;r!nnrTr, T rp T 

GCCACCCA.qCCCCAGCGAGGGCTCAAAGT CTGGGGGrTr.AGGArrT^rA^ 
GAGGGGGGAGGGAAGGGAGCCAAGGGA TGGACATPTTCTrATTTTTyTT'r 

CTTTCAATAAATGTrACTTTTTGAGGGAAAAGAAGGAArrnTn^ APA TTT 
TAGACArrrTTTTCTTTGGOGTAGGCTrT TCcrrrAr,nrnccGGrTPTTr 

TCCCAAAAAAAAAAAAAAAArACTAATCCATTTGrTTAAPPTA n TA A^ T 
CC AG ATC rr A GAGGCTCTCCT C A CC TCAGC . TG AGfTPPTT'Tre a a a <~!<j>a a t 

TCAA^GftrTATGTCACrCAGCCTCATTTGCTGGAPPAaATPTq p.aP-P-P-A 
GAACCXTTA AAACrrCTA AGTGAGGTTGGrGAGGr!P,nTT O TPPPPAP T q T ^ 
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GGGGGAAGCAGGGGAGAGAAAATGGTAGCCATTT TTArATTnTTTTn Tft T 

AGTATTTATTGATTCAGGAAACAAACACAAAATTCTGAATAAAATGAPTT 
GGAAACTGCCTGTTTGGGCTTrTrATTTrTTArrTPrrrTTnPr T r TCCC 

ACCTGCTACTGGGTGCATCTCTGCTCCCCCCTTCCCCAGCAGATGGTT 

ACCTTTGGGCTGTTGCTTTCTTGTCACCATCTGAGTTCTCAGACGCTGGA 

AAGCCATGTTCTCGGCTCTGTGAATGACAATGCTGACTGGAGTGCTGCC 

CCTCTGTAAAGGGCTGGGTGTGGATGGTCACAAGCCCCTCACATGCCTCA 

GCCAAGAGGAAGTAGTACAGGGGTCAGCCCAGAGGTCCAGGGGAAAGGAG 

TGGAAACCGATTTCCCCACCAAGGGAGGGGCCTGTACCTCAGCTGTTCC 

CATAGCTACTTGCCACAACTGCCAAGCAAGTTTCGCTGAGTTTGACACAT 

GGATCCCTGTGGATCAACTGCCCTAGGACTCCGTTTGCACCCATGTGA 

CACTGTTGACTTTGCCCTGACGAA . GCAGGGCCAACAGTCCCCTAACTTA 

ATTACAAAAACTAATGACTAAGAGAGAGGTGGCTAGAGCTGAGGCCCCTG 

AGTCAGGCTGTGGGTGGGATCATCTCCAGTACAGGAAGTGAGACTTTCA 

TTTCCTCCTTTCCAAGAGAGGGCTGAGGGAGCAGGGTTGAGCAACTGG 

TGCAGACAGCCTAGCTGGACTTTGGGTGAGGCGGTTCAGCC ATGAGGC 

TGGCTGTGCTTTTCTCGG GGGCCCTGCTGGGGCTACTGGPAn gt aaggag 

gaaggaggctgaggggagggggcccctgggagggagcctgccctggg 

ttactaaccatctcctctctaccaaaaaCCCAGGGGACAGanAAT 
GACTGTCCTCACAAAAAATCAGCTACTTTGCTGCCA TCCTTnArnGTHA 

CACCCACGGTTACAGAGAGCACTGGAACAACGAGr CACAGGArTArrAAn 

AGCCACAAAACCACCACTCACAGGACAACCACCACAG GCACCACrAr.r 
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CACGGACCCA rrjArTCCCACTCACAACCCCACCACCACCAGCCATGGA 

AACGTCACAG TTrATrrAAGAAGCAATAGCACTGCCACCAGCCAGGGACC 

CTCAACTGCnArTGACAGTCCTGCCACCACTAGTGATGGAAATGGCAGGG 

TTCATCCAA rAAGrAAGAGrACTGCCA CCAGCCCAGGATTCACCAGTTCT 

GCCCACCCAG AACnACrrCCACCCTCTCGGAGTCCTAG CCCAACCTCrA 

AGGAGACCATTGGAGACTACACGTGGACCAATGGTTCCCAGCCCTGTGT 

CGAGCTGCA AGCCCAGATTCAGATTCGAGTCATGTACACAACCGAGGGTG 

GAgGAGAGgtaaagctaaaactgggggatgagaggggagggaggcaggac 
tggttataggctcagagggaagaaggaagaggggacaggnaaccttggcc 
ggcatcgcatgcagtcttgtgaccttccagtctttaacttccgcagSSCl 

GGGGTATCTCTGTNCTGANCCCCAACAGAACCAAGGTCCAGGGAAGCTOT 

GGGGGTGCCCATCCCCACCTGCTTCTCTCATTCCCCTATGGACACCTCAG 

CTTTGGATTCATGCAG gtatagccatgacctcagtctcacccctcacr.ca 

gcctcccggcgcccctcccctcccaatcccacacgctactccttcctctg 

tQgacraaaaataccacctacaccttcctcttcqccccacag GACCTCC 

AGCAGAAGGTTGTCTACCTGAGCTACATGGCGGTGGAGTACAATGTGTH 

CTTCCCCCACGCAGCAC gtaagtaacntcc!ttgcr!tt:tc:fcat-.1-gr.f^r-n 

actagacgccagggttcctgaaaggactaagctggggccagggaggtgg 
atacraatctaacccttcctcactcctccag AGTGGACATTCTCGGCTrAn 

AATGCATCCCTTCGAGATCTCCAAGCACCCCTGGGGCAGAGCTTrAnTTn 

CAGCAAGT CGAGCATCATTCTTTCACCAGCTGTCCAC CTCGArrTnrTr T 
CCCTGAGGCTCCAGGCTGCTCAGCTGCCCCACACAGGGGTCTTTnnnrAft 

&gtaagacctacctactccttccctcctagaatcctcccactgcactgaa 
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aaccccttccccaggcccataagccactcatctctcttcttaacccccca 
aatctcgctctcccagcttgtcatggctacagggcagctttctttccatc 
ctctacaagactctgccagtttcccccttttatcactgctgagtcactgc 
ggtgagctcctcaccaatctcctactccccagcatccccccattccctcc 
tcccacctttatcccaaccagcacgtcactgcaaatacctacctgcccta 

tccttccaccaaGTTTnTCCTnrrrrA GTGiACroriTCCATrTTarTa^r T 
CTCATCATCGGCCTGATrCTTrTTnflr CTCCTCGCCCTOGTrirTTATTnr 

TTTCTGCATCATGCGGAGAGGrrGATrr GCCTACGAGGCCCTCTGAGGAT 

T TGCTTCAAACCCCAGGGCACTGAGGGGGTTGGGGTGTGGTGGGGGGGT 

ACCCTTATTTCGTGGAGAGGrAArTGGr TCAAAGAGAATGTTATTTTGrT 

TCCCTTTCTTGAAGAACAAAAAGAAAGG CGGGGATGAGGGCTCATGrrTG 

TAATCCCAGCAGTTTGGGAGGGTGAGGr AGGTGGATGACTGGAGGTrAGG 

i 

AGTTTGAG ACCAGCCTGGCGAAGATaG TGAAACC CTGTGTrTAPTa ft a a & 
TACAATTAGGCAGGTGTG GCGGCGTAATCGGAGCTGGCCTGTAATrrran 
CTACTTGGGAGGGTGAGGGAGAAGTGGTTGAAGGGAGGA GGTGGAGGTTG 
CAGTGAGCrGTCATGGnGrrAGTGAGrra AGAGTCGCGCCAOTGGArTrr 
AGCCTGGGCGACAGAGGCAGAGTGTrTrAAATAAATA AATATGAGATAAT 
G CAGT CGGG AG AAGGG AGGG AG AG A ATTT TATTAA ATGTG ACG A A CTG O r 
CCCCCCrrrCCGCrrGAGGAGGAGAGGAG CAAAATTTATGCAAATrTTTG 
ACGGGGTTTTCCTTGTCCTGCCAGGATTAAAAGCCATGAGTTTrTTnTra 
CATGCCTTTCTATGCCTTCCATGGCTGGGTCTCAGGGAGCrGGaanranr 
TGCTGAGGAGGGATGAAAATGTCAGTGTGTGACGAT GCCTGATPir^TTra 
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CCCCCCAAAGCCTGGCACAGCTGnTO TTGGGTCTGCCGTGrdTrrrTTrr 

TTCCTCCTCTTGGGGCCACTGGCTGCTCGAGTTG CCCATGGGTGGGAAGr 

CGGTAGAGCGATTGATGGCCGCAGCG TTCTTCCTGACGCTrGTAraGTTT 

C AAATGCAGCAG ACAGGf! A A AGGA ATG AGTGGGGGGCTGTGG A A CTT ("* A T 

TCCCAAAGGCAGCGCCAGTGGCTCCTGAGCAATGAGA ATGTGrTGTrr T r, 

TCCACCATATTCAAGGCCAGCAGAAGAG CCCGATTAAACCGTrGrAGrGft 

CCTGGCATGCTCCTATCCCACCTGCAAGGGGTTGAATGAAG AAGGAGrAG 

TGGGTACTGTGACGTGGArTGGGGGrTr CTGGGAArAGCATGrrrrrrAr 

ACGGGGCCACCTGCGAAGGGTAAGTTrAT GCCGGGAGTArTTGAGATGAG 

GAGTG TCACTCTCAGGArAGCCAAGGTrrAGATTGTAGAAAGGAGGTrrr 

AGATGGCCAGAGGGTGGAGrAGrAGTGp | G, CGCCAGTrrrAGGrATTAraG 

CTGGCTACGGCGGAATGGrTGGGAGrraG GATGAGGAGrArrrrrrAGrr 

GTAGGA GCCCCAGGAGGCTTCCGGCTTCCAAGGCCMAGAGACTGrrrPAr 

AAGGSAGCCCTCACCTGGGAGGGGGrAGr AAGCCGCACrTCTTGrrTGraG 

ACATCCGTGTGACCTTGTAGACTTTGGAG GGGGGGCGPAAAGGGGTGATr 

CACAGCGGAATGAGGrArGGGTGGGrArr GTGGGTTGGGGTPGGGGrGGT 

CGGTAACGA AGPJl G A A rGPPrrr a A GGT AGTrr A A G GTGCPPTTTTPP 

CCAAGCAGnGCTGMAGGKTrMATr'TGGfr TCGCCGAAGTCTGTGrrrAGr 

ACGTGGT ^rGCCTCGGGCCAGCGGGGAGGGGGGGAATrrrGGGArTGrTr 1 

GCCAGGC CTCGGCTCCCCGGAGACTCTTGGGGGTSTGGGCCCCAAGGGTG 

ATTCAGGTGCTGGcrTTKrrrrGArrTGGG ATGrTTrrrrcAr.GTrTTrT 

TTTGTTTTAATGTrrrGGr T CCCAGCAGTT GCCGGCGGAATTCATGVTrrr, 

AGGCCTGftGr CAACCGGAGGCGAGArAAGrACAGGGrrrTGrGrGraarr 
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CCCTTCCCTCTAGAGArGCTGAGAGaar GGGAGCTAGTAGCGrrrrrarr 
CAACGCCAOCTGGGAGArTrrGGrTrrTT rTCTrTrAArTTGGAAPAATa 
CAAA GTGTGCTAGGAGAAGACAAGftTGGCGCCCAGrAGnAGGAGrGGAr;a 
MG GCAGGGGTGTAAATCTGGCTTCnAAArTGGAAGrGTraA CAAAGGrn 

TGGGAGGTCTAACrGrGrAGnrnTnrar T (7 TTCGGCAAGr TT 
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Sequence ID No 4 

ctagctggtctgagcatctctgcc ATGCGGCTC P.nTfiTGTGTCTGATCTTGCTAGGACCGC 
TTATAG qtaaqgagaaatgggaggtgggggagggagggctcatgggcaggagcctgcacctgggtggcc 
aaccaatctcttactgaaag CCCAAGGAACAGAGGAAGACTGTCCTCACAAAAAGGCCG 
TTAnTrTCCTGCCATCCTTCACGATGACACCTACAGCCACAGAAAGCACAGCCA 
GnnCTACGAnnAGnCACAGGCCCACCAC CACCAGTCACGGAAATGTCACAGT 

TnACACCAGCTCCGGACCCAGAACTGTCACTCATAACCCTGCCACCACC ACC 

AGTCATGGGAATGCCACAATITCTCATGCCACAGTTTCTCCCACCACAAATGGC 

ACTGCTACTAGTCCAAGATr .rTCCACTGTTGGCCCTCACCCTGGACCACCTCC 

ACGCTCGCQTAGTCCAAGGTnCAAGGG GGCTCTTGGGAACTACACGTGGGCCA 

ACGGCTCC C ATCCTTGTGTTC AGCTC C AAG C CC AAATTC AAATCC G AATC CTAT 

ACXCAATTCAGGGTGGAAGAAAG qtaaaqctaaaqtggggcttaaagagggcaagaggcaa 

gtcctgggctcgttcagcagggaagaggaagagaagaggaggggataaactggatggagcattcttgtgattt 

cagacccaccattgcacttctacag GGTTGGGGCATATCTGTTTTGAATCCCAACAAAACC 
AAGGTCCAGGGAGGTTGTGACGGTACCCATCCCCACCTGTCTCTCTCATTTCC 

TTATGGACAGCTTACOTTTGGATTOAAACAG qtatacaqcttqaqtttqtctctatcctctattcttcc 
atatcccatacctgtacccccggagcctctgttcttgctctgtggacatggatgcctctgtccctgatgccttgagtctttyt 
gttcaccttaag GACCTACATGAGAGCCCGAGTACAGTCTACCTGGACTACATGGCG 
GTGGAATACAATGTGTnCTTCCCACAGGCAGCAC gtgagtaatctcttctccttaccacacta 
aaagtctaggctgggcgtgctgggctggtggggaggactcaggagtcaggactggatttgactcttaattactaat 
tactgcag AGTGGACATTCATGGCGCAGAATTCATCTCTTCGAGAGCTCCAAGCTC 
CCTTGGGCCAAAGCTTCTGCTGTGGAAATGCAAGCATAGTTCTTTCTCCAGCTG 
TTCACCTTGACCTGCTCTCTCTAAGGCTACAGGCTGCTCAGC TGr.r.TGACAAGG 
GACACTTCGGGCCAT gtaagccctacctacttcttctttcctagagctctcccagtgctctggaaaccttccc 



SUBSTITUTE SHEET (RULE 26) 



WO 97/42337 



50 



PCT/GB97/01209 



cagaactctttttctagacctccgcctcctctacaagactgccttatttcccctttgctgactgctctcacctatgggttggc 
tcctcagcctttcttcatcccctttcctcttccgtcctcccccacctcccacctttagcccgtccaccactgcagcaattct 
cagggatcccattattctgccag GTTTCT CTTGCAACCQTRACCAGTr.r.r.T CTTfinTnr.r 

TCTCAT CATTGGCCTGGTCCTCCTCGGCCTCCTCAnnnTG GTGr.Tr.ATr.nrr 

nCTGCATCACCCGCAGACGACAATCAACCT A CCAGCcnr.Tr.T n A gcatctgcccc 
agtccactgtg 
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